NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PipSwitch: A Circuit Switch Using Programmable Integrated Photonics

Ding, Eric; Singh, Rachee (April 2025, Optica OFC)

Free, publicly-accessible full text available April 2, 2026
Aqua: Network-Accelerated Memory Offloading for LLMs in Scale-Up GPU Domains

Kumar, Abhishek_Vijaya; Antichi, Gianni; Singh, Rachee (April 2025, ACM ASPLOS)

Free, publicly-accessible full text available April 3, 2026
Chip-to-Chip Photonic Connectivity in Multi-Accelerator Servers for ML

Kumar, Abhishek Vijaya; Devraj, Arjun; Bunandar, Darius; Singh, Rachee (April 2025, Optica OFC)

Free, publicly-accessible full text available April 2, 2026
A case for server-scale photonic connectivity

Kumar, AbhishekVijaya; Devraj, Arjun; Bunandar, Darius; Singh, Rachee (November 2024, ACM SIGCOMM HotNets)

Full Text Available
Making Sense of Constellations: Methodologies for Understanding Starlink's Scheduling Algorithms

https://doi.org/10.1145/3624354.3630586

Tanveer, Hammas Bin; Puchol, Mike; Singh, Rachee; Bianchi, Antonio; Nithyanand, Rishab (December 2023, ACM)

Full Text Available
Glowing in the dark: uncovering ipv6 address discovery and scanning strategies in the wild

Tanveer, Hammas_Bin; Singh, Rachee; Pearce, Paul; Nithyanand, Rishab (August 2023, USENIX Security)

Full Text Available
Teal: Learning-Accelerated Optimization of WAN Traffic Engineering

https://doi.org/10.1145/3603269.3604857

Xu, Zhiying; Yan, Francis; Singh, Rachee; Chiu, Justin; Rush, Alexander; Yu, Minlan. (September 2023, ACM SIGCOMM)

Full Text Available
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches

Shah, Aashaka; Chidambaram, Vijay; Cowan, Meghan; Maleki, Saeed; Musuvathi, Madan; Mytkowicz, Todd; Nelson, Jacob; Saarikivi, Olli; Singh, Rachee (April 2023, USENIX)

Machine learning models are increasingly being trained across multiple GPUs and servers. In this setting, data is transferred between GPUs using communication collectives such as ALLTOALL and ALLREDUCE, which can become a significant bottleneck in training large models. Thus, it is important to use efficient algorithms for collective communication. We develop TACCL, a tool that enables algorithm designers to guide a synthesizer into automatically generating algorithms for a given hardware configuration and communication collective. TACCL uses a novel communication sketch abstraction to get crucial information from the designer to significantly reduce the search space and guide the synthesizer towards better algorithms. TACCL also uses a novel encoding of the problem that allows it to scale beyond single-node topologies. We use TACCL to synthesize algorithms for three collectives and two hardware topologies: DGX-2 and NDv2. We demonstrate that the algorithms synthesized by TACCL outperform the Nvidia Collective Communication Library (NCCL) by up to 6.7x. We also show that TACCL can speed up end-to-end training of Transformer-XL and BERT models by 11%–2.3x for different batch sizes.
more » « less
Full Text Available
PredictRoute: A Network Path Prediction Toolkit

Singh, Rachee; Tench, David; Gill, Phillipa; McGregor, Andrew (January 2021, SIGMETRICS 2021)

Full Text Available
Characterizing the Deployment and Performance of Multi-CDNs

Singh, Rachee; Dunna, Arun; Gill, Phillipa (October 2018, Proceedings of the ACM SIGCOMM Internet Measurement Conference)

Full Text Available

Search for: All records